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ferent priorities from input lines, via receive adapters, to 
output lines via transmit adapters. A spacing function is 
performed to limit the rate of data transfers through the 
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and in the absence of a backpressure signal indicating the 
switch is congested at its output. If high priority data is 
detected or a backpressure signal has been generated, the 
data transfer rate is increased to deal with possible conges- 
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METHOD AND SYSTEM FOR PERFORMING 
TRAFFIC CONGESTION CONTROL IN A 
DATA COMMUNICATION NETWORK 

FIELD OF THE INVENTION 

This invention relates to a method and system for per- 
forming traffic congestion control in a data communication 
network and more particularly to an improvement in per- 
forming such congestion control at network node switch 
level in a packet switching environment 

BACKGROUND OF THE INVENTION 

Effective modern digital networks must be capable of 
operating in a multimedia environment in which they must 
transport different types of data, representing alphanumeric 
files as well as voice, image, video, etc., over the same 
network media. The operation of the networks should satisfy 
dissimilar requirements specific to each of these kinds of 
data. 

Data can be divided into different types. One type is 
non-real-time data, i. e. alphanumeric data that can be 
delivered to an end-user without loss of quality even if 
delayed. Another type is real-time information (such as 
voice or video data) that must be transmitted to the end-user 
without excessive time delays. If the data cannot be trans- 
ferred within the allowable delay period, it is discarded 
rather than allow it to distort reconstructed end user 
information, such as speech or video displays. 

A specific type of non-real-time information is non- 
reserved information such as control information that arrives 
periodically in the network. This type of information has no 
significant rime constraints on its delivery. 

Different techniques have been developed for transporting 
data, such as packet switching techniques where digitized 
data is transported in individual fixed or variable length 
packets, and circuit switching techniques. 

A basic advantage of packet switching compared to circuit 
switching is that packet switching permits multiplexing of 
data of different types or from different sources onto a link, 
which optimizes utilization of the link's capacity. A draw- 
back of packet switching compared to circuit switching is 
that packet switching may result in delayed delivery of some 
packets in the data stream, adversely affecting the quality of 
reconstructed real-time information, such as voice or video 
information, at the destination. It is known to control a 
network in such a way that packet delay is regulated by 
considering the impact of every new connection that is to be 
established across a packet switched network. If a new 
connection would drive packet delay beyond acceptable 
limits, the request for establishment of that connection will 
probably be rejected. For convenience, such a known 
method is briefly described here. 

FIG. 1 shows a sample packet switching network includ- 
ing seven switching nodes SW_1 to SW_7 interconnected 
by trunk lines (TL's), that can be accessed by access lines 
(AL's) connected to outside data terminal equipment 
(DTE's). 

The network control architecture is a distributed one, 
which means that each switching node is controlled by a 
control point CP. All CP's are interconnected through the 
trunk lines via a control point spanning tree CPST which 
provides an efficient means for multicasting control mes- 
sages between the control points. When a CP wants to 
broadcast a message to other CP's in the network, it sends 
this message to the predefined CPST switching address, and 
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the architecture provides the means to route this message on 
every line of the CP spanning tree, and only on these lines. 
This architecture also provides the means to initialize the 
tree address on each switching node, and to automatically 

5 reconfigure the tree in case of line or switching node failure. 
Each CP includes a copy of the topology database that 
contains information about the network. It includes the 
network physical configuration, the line characteristics and 
current status. For every line n in the network, the line delay 
T(n) and bandwidth R _res(n) of this line are defined and 
recorded in the topology database. This information is 
distributed to other control points via topology update 
messages sent over the control point spanning tree whenever 
it presents significant changes. 

5 For more information on such a Spanning Tree 
organization, one may refer to the Copending European 
Patent Application number 94480048.1 filed on 25 May 
1994. 

Also shown in FIG. 1 are user terminal equipment 
2Q DTE_A and DTE_B which are connected to the network 
via access lines AL_A and ALJ, respectively. This equip- 
ment is to be interconnected through the network with a 
given quality of service (QoS) specified in terms of a 
mqxiTrmm allowable delay T_jnax and a packet loss prob- 
23 ability P_Joss. 

At the switching node SW__1, the control point CP_1 
first uses the QoS and the traffic characteristics specified by 
the user (such as peak rate, mean rate, average packet length) 
to compute the amount of bandwidth C_eq. called the 
30 equivalent capacity of the connection, to be reserved on 
every line on the route, in order to guarantee a packet loss 
probability P_l (n) on this line which is much smaller than 
the loss_probabiliry P _loss that has been specified for the 
connection. 

35 Based on the information that is available on a line basis 
in the topology data base, the control point CP_1 then 
computes the best route through the network to the desti- 
nation. The route selection process uses a modified Bellman- 
Ford algorithm to find the minimum weight, m ini mum hop 

40 count, route from the origin to the destination, using avail- 
able lines and satisfying the QoS requirements. 

Algorithms for determining the equivalent capacity 
parameters and the best route for a given message are known 
in the art The algorithms are not essential to a description 

45 of the present invention and, therefore, will not be described 
further. For additional information on the algorithms, one 
may refer to the following publications: 

R. Guerin, H. Ahmadi M. Naghshineh. "Equivalent Capac- 
50 ity and its Application to Bandwidth Allocation in High 
Speed Networks*. IEEE Journal of Selected Areas in 
Communications, JSAC-7* September 1991. 
H, Ahmadi, J. S. Chen, R. Guerin, L. Gun. A. M. Lee andT 
Tedijanto, "Dynamic Routing and Call Control in High- 
55 Speed Integrated Network", Proc, Workshop Sys. Eng. 
Trqf. Eng., TTC 13 pp 397-4C3, Copenhagen, Denmark. 
Assume a selected route is to include switching nodes 
SW_1. SW __Z and SW__3. The origin control point CP_1 
sends a connection set-up message along the route, causing 
60 a copy to be delivered to the control point of every switch 
on the route. This message contains a list of the network 
addresses of the control points on the route, the list of the 
line names between these control points, the requested 
bandwidth C_eq, the priority level of the connection, and a 
65 connection correlator C_cor which is set by the origin 
control point CP_1, and which is used by all other CP's to 
uniquely identify the connection. 
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Upon reception of a copy of the setup message, each 
control point reserves the necessary bandwidth on the trunk, 
creates a new entry in the connection table of the adapter, 
which is used for label swap purposes, and sends an 
acknowledgement back to the origin control point CP_1. 
When CP_1 has received an acknowledgement from every 
CP on the route, it starts the transmission of data over the 
connection. 

For a specified QoS to be met for a connection, the proper 
amount of bandwidth must be reserved on each line on the 
route and, additionally, packets with different priorities, 
must be properly scheduled at each switching point 

FIG. 2 represents a switching node constructed in accor- 
dance with known art The switching node includes receive 
adapters (20) which provide interfaces to input lines 1 
through N and transmit adapter (22) providing interfaces to 
output lines 1 through N. While shown as separate devices 
in the drawing, the receive and transmit adapters might be 
combined into a single adapter device or be implemented by 
means of a programmable processor. A switch fabric (24) 
(also herein referred to as "switch") controls connections 
between the receive adapters and the transmit adapters. 

The switch fabric is schematically represented in FIG. 3. 
II includes an input router section (31) far scanning the 
receive adapters and directing input data to output address 
queues (32) through a shared memory (33). A control section 
(34) is also provided to control the operation of both the 
shared memory (33) and the output address queues (32). 

As shown in FIG. 2, a routing device 26 directs an 
incoming packet into a switch input queue (SIQ) (25) 
located in the receive adapter (20). The switch is assumed to 
be an Asynchronous Transfer Mode (ATM) switch, capable 
of switching ATM (fixed length) packets and variable length 
packets. A routing header for each incoming packet contains 
one bit which indicates whether a packet is an ATM packet 
or a variable length packet. A variable length packet is 
segmented by a receive switch interface RSI into fixed 
length ATM cells before those cells are processed by the 
switch fabric 24. The cells produced by segmentation of a 
variable length packet are switched by switch fabric 24 to 
the appropriate transmit adapter where they are reassembled 
into the original variable length packet form by the transmit 
switch interface XSL Of course, ATM cells received at a 
receive adapter need not be segmented before switching or 
reassembled after switching; mat is, ATM cells can be 
switched natively by the switch fabric 24. 

It should be noted that the term "ATM packet" and "ATM 
cell" may refer to exactly the same data structure; namely a 
fixed length data unit having a multi-bit header structure of 
fixed length and a multi-bit data or payload structure which 
actually contains end user information. 

At the transmit adapter, the packet may enter one of three 
possible queues, according to its priority. Three possible 
traffic priorities are defined, real-time (FT), non-real-time 
(NKT) t and non-reserved (NR). Typically, the highest pri- 
ority class (FT) is used to assigned to delay-sensitive data 
such as voice or video data, the second class (NKT) is 
assigned to delay-tolerant data such as interactive alphanu- 
meric data, and the third class (NR) is assigned to delay- 
insensitive data such as network control information or low 
priority data operations such as bulk file transfers. The 
real-time RT priority level may itself include traffic of 
different sub-priority levels. Upon request from the transmit 
line, a scheduler (27) services the transmit adapter queues. 
More specifically, at every request from the transmit line for 
a new packet the scheduler (27) first looks at the real-time 
queue and services a real-time packet stored in that queue. 
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If the real-time queue is empty, then the scheduler (27) looks 
at the non-real-time queue and services a non-real-time 
packet in that queue. The non-reserved queue is serviced 
only when bom real-time and non-real-time queues are 
5 empty. 

The routing operation at each switching node is imple- 
mented as follows. Switching labels allocated by CP __2 and 
CP _3 (see FIG. 1) will be identified as labels L2 and L3, 
respectively. 

10 At the receive adapter, a network header is appended to 
the data packets that are received from the user equipment 
DTE_A. The network header contains the label L2, and a 
switch routing header that is used locally by switch SW_1 
to route the packet to the desired transmit adapter for 

15 transmission over the line TL_1. At the next switch SW_2, 
the label L2 of each packet is used to access a look-up table 
which returns the label L3 and a switch routing header to be 
used by the switch fabric at SW_Z The new label L3 is 
swapped for the old label L2, and the packet is forwarded in 

20 accordance with the switch routing header to the appropriate 
transmit adapter for transmission on line TL_2. 

The routing operations are repeated at each switching 
node until the packet reaches the destination transmit port 
adapter where the network header Is discarded and the 

25 packet is sent to the user equipment DTE_B. 

If the switch were an ideal switch, that is. an infinitely fast 
switch, the above routing operation, combined with the 
reservation of the equivalent capacity on the transmit trunk, 
would always ensure that a specified quality of service 

30 parameter (QoS) would be satisfied on a connection. In 
practice, however, the speed of the switch is limited, which 
leads to a requirement for some queuing at the switch fabric 
level as described with reference to FIG. 3. 
A switch can route, in parallel, as many cells as the 

35 number of receive and transmit adapters connected to the 
switch. In the switch fabric core, the shared memory is used 
to store cells, and an address queue per output adapter is 
required. At every switch cell cycle, the control section (34) 
looks for an available adapter in a free address table (not 

40 shown) and provides as many free addresses as there are 
available adapters connected. 

When a cell from a receive adapter *T enters the switch, 
it is stored at an address that corresponds to that adapter. 
Once its destination "j" is determined, a pointer to the cell's 

45 storage address is added to the output address queue "f. 
From a performance point of view, two characteristics are 
very important at switch level. One characteristic is a 
speed-up factor, defined by the ratio between internal switch 
speed and the speed of the attached lines. The second 

so characteristic is the size of the shared memory in the switch 
fabric. Different designs can be implemented in order to 
optimize the switch behavior, for example, running the 
switch at a high speed switch or including a large memory 
for the switch 

55 For a speed-up factor of two and assuming the same 
attachment speed on all adapters, with shared memory 
capable of handling only a single cell, a switch could receive 
and forward a cell from each input adapter in a single cycle. 
On the other hand, with an infinitely large shared memory, 

60 a cell from each adapter could also be received and for- 
warded every cycle. In these two cases, the switch would 
have an ideal behavior. 

Ideal solutions such as those described above would be 
expensive to implement, if they can be implemented at all. 

65 A high speed switching medium means expensive high 
speed adapter interfaces. An extremely large shared memory 
is costly to provide. 
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Given the cost and impracticality of ideal solutions, less FIG. 7 is a schematic representation of a buffer manage- 

than ideal solutions involving limited speed-up factor and ment mechanism to be used in this invention; 

limited memory size must be found. If the size of the. shared FIG. 8 is a more detailed block diagram representation of 

memory is limited, contention for the available memory the buffer management mechanism of FIG. 7; and 

spaces may arise. Even if the switch speed is higher than the 5 9 | s a detailed flowchart of an implementation of one 

line speeds, switch resource contention may occur if cells embodiment of the invention. 

from different receive adapters have the same destination „^™.™t^ 

STiriS in thTsame output address queue. H»e DETAILED DES^ONOF PREFERRED 

switch will not have enough time in one cycle to empty its liMBUUHVUiw 1 a 

output queue. After several cycles, the switches* queue/ 10 pjQ 4 j S a schematic representation of a switching node 

memory may become full This case may occur frequently modified to implement the present invention. 

when large packets are processed as such packets are seg- ^ ^ - n ^ receivc adapter is first organized 

mented into smaller fixed length cells, all of which are based on ^ previously mentioned priority criteria, differ- 

directed to the same output entiating between real-time (FT) data, and non-real-time 

In practice, a backpressure signal (BP) can be generated is ^ mcludillg in ^ preferred embodiment as described 

and sent to input adapters when switch congestion is herein puIC non _ rca i_timc data (NKT) and non-reserved 

detected; mat is, when the shared memory is full or the traffic ^cive adapter includes an input routing 

intended output queue is full. The existence of the back- sys t e m (41) with a switching mechanism for dispatching 

pressure signal indicates a cell has to be discarded at the ^ received ^ packet into one of a set of buffers or 

switch and must be sent again. 20 storage queues, with each queue being dedicated to data 

ciTvnor apv hp thf iKTVFNTfON havm « a specific priority level; that is, for handling FT. NKT 

SUMMARY OF THE INVENTION ^ m respectively. A manager module (43) is also 

The present invention relates to performing traffic con- provided to control the traffic to be directed from the receive 

gestion control in a data communication network made for adapter queues (42) toward the switch fabric (24) via the 

transporting data packets of different priority levels through receive switch interface (RSI). Routing and management 

network nodes. Each of the nodes includes a switch fabric operations are preferably controlled by dedicated software 

for switching the network data between receive adapters (e. g. picocode) stored in the system. In addition, the 

connected to network node input lines and transmit adapters previously mentioned backpressure signal (BP), generated 

connected to network node output lines. The switch fabric M by the switch to signal a congestion condition, is also 

includes a switch congestion condition detector and a back- represented in this figure. 

pressure signal generator. The receive adapters include rout- The use of separate queues for KT, NKT and NR data, 

ing means for routing received data into different queues respectively, avoids the situation of a higher priority packet/ 

based on priority levels and for routing the queued data cell being embedded in a data stream following several 

selectively at a predefined adapter speed through the switch. ^ lower priority packets, which can happen if only one input 

In providing congestion control, the queues are monitored to queue is provided. With separate queues for each priority 

detect whether a predefined high priority queue is empty. If level, a maximum of one Iowa priority packet may have to 

the high priority queue is empty, a spacing operation is set be transmitted before a received higher priority packet can 

ON to set receiver adapter speed at a predefined low value. be sent. The maximum queuing delay (and jitter) f or a KT 

If the predefined high priority data queue is not empty, the ^ packet is given by: 
spacing operation is overridden. The switch condition is 

monitored to detect the presence of a backpressure signal. If Maximum ^^^f^ slzc (1) 

a backpressure signal is detected while the spacing operation switch speed 

is ON, the spacing operation is overridden. f or multiple queues, while it is given by: 

BRIEF DESCRIPTION OF THE DRAWINGS 45 Maximum NR/SRTautuc size (2) 

switch s pe ed 

While the specification concludes with claims particularly 

pointing out and distinctly claiming that which is regarded for a single queue. 

as the present invention, further details, characteristics and In practice, the use of three separate priority-based queues 

advantages of preferred embodiments of the invention may 50 can reduce jitter and delay for a real time signal by a factor 

be more readily apparent from the following detailed of 50. 

description when read with reference to the accompanying However, while multiple queuing is one way to limit, 

drawings wherein* within a given receive adapter, the impact of rnixing low 

HG 1 is schematic representation of a data communica- priority traffic and high priority traffic, it does not solve the 

tion network in which congestion control may be practiced 55 switching problem caused where d^erent receive a^pters 

in accordance with this invention; are processing data having different priority levels to pother 

. _ . . v ^ f words, NKT/NR traffic received through adapter "1 may 

FIG. 2 is a scheme <* J^SS traffic received thrc^adapt* T » case 

a switching node of the network of FIG. 1, rf COflgcstioiL 

FIG. 3 is a schematic representation of the switch fabric w Tq e0Surc xmsfci of information and solve the 

of FIG. 2; above-described switch congestion problems, a system con- 

FIG. 4 is a schematic representation of a switching node s tructed in accordance with the present invention is provided 

modified to implement the present invention; with a spacing mechanism such that, under normal operating 

FIG. 5 is a general flowchart of a method for implement- conditions and based on the reserved bandwidth parameters, 

ing this invention; 65 the available switch transmission bandwidth is only partially 

FIG € is a more detailed representation of an algorithm utilized. In other word, the spacing mechanism can be used 

for implementing the invention; to reduce the rate of transfer of data to the switch fabric. This 
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creates the possibility of subsequently* partially or totally, raent is possible where the switch is already operating under 

neutralizing the spacing mechanism to increase the switch congestion conditions. To that end. timers are provided If 

input rate whenever required to deal with switch congestion congestion (as defined earlier) occurs when a NK17NR 

problems. packet is being transmitted a timer is set, and packet 

For example, there will be tunes the switch will be in the $ transmission is completely stopped on the given adapters) 

process of transferring a low priority packet when a higher for a period defined by the timer. Each adapter switch 

priority packet is received. In such a case* the spacing interface manager sets such timers independently, upon 

mechanism can be overridden so that the current low priority reception of the backpressure signal from the switch. Then, 

packet is transferred to the switch fabric as quickly as NRT and NR traffic, which might otherwise increase the 

possible to make way for the higher priority packet congestion period, is temporarily stopped at the receive 

Normally, when a packet is received from the line, it is adapter level, which allows the switch to return to a steady 

stored in a buffer memory up to K kilobytes (K is a system state behavior. There can be two timer values Tl and T2. 

parameter) and is sent to the switch at the adapter internal Timer value Tl is used where an NRT packet is being 

speed, which often corresponds to the switch speed. transmitted when congestion is detected. Timer value T2 is 

A large packet, which must be segmented into several used where an NR packet is being transmitted when con- 
ceils for handling within the switch, will always cause a 15 gestion is detected T2 is preferably larger than Tl to induce 
number of successive cells to be forwarded to the same a "priority" between NKT and NR packets. Typically. Tl and 
output line at a high rate since the switch speed is always T2 values should be between 1 and 10 switch cell cycles. If 
higher than the line speed the timer values are too small, the switch will not have time 

Thus, large low priority (NRI7NR) packets being trans- to return to a steady state and the premature resumption of 

fared on low speed lines may induce switch congestion and 20 transmission of NRT/NR packets will again induce conges- 

negatively impact real-time (RT) traffic performance of the tion. If the Tl and T2 values are too great data may remain 

whole system. in input queues for excessive periods of time before being 

To limit this effect the spacing function is implemented as transferred to the switch fabric. The block diagram of FIG. 

follows. The cells are sent to the switch every X cycles; the 5 shows the operation of such timers, 

value of X mainly depends on receive line speed, and is 25 The operation of the enable timer function 54 is triggered 

chosen in such a way that it verifies the relation: by a backpressure (BP) signal indicating the existence of 

congestion conditions. Whenever an active timer (Tl or T2) 

..Y -r f0 J * l ^|?!lL!f ccd) ^ expires, cell transmission is normally resumed in the receive 

Bac5peod adapter. However, if input FT traffic is detected the timers 

Spacing is implemented only for NRT and NR flows, in part 30 are disabled (block 55) under specific conditions already 

because spacing causes delays in switching. However. NRT/ described and to be described further in connection with 

NR data is not delay sensitive. Also, the larger the packet FIG. 6, and again cell transmission is enabled at full switch 

the more useful the function. Small packets ( 1 cell) can only speed, That is, when RT input traffic is detected, the spacing 

be sent from the adapter at line speed. However, when a function is disabled or overridden, 

large packet is received the train of cells is sent after 35 Details of the process for controlling the spacing mecha- 

reception of the whole packet at adapter speed nism are shown in FIG. 6. First, a test (step 61) is made to 

The spacing function improves the switch behavior and test whether spacing is ON. If spacing is disabled a cell is 

fairness on an overall machine basis, but may decrease RT transmitted normally; that is. at high switch speeds. On the 

performance at a given adapter. To deal with that, the case contrary, if spacing is found to be ON, a test (62) is 

of a RT cell arriving while a NRT/NR data is being spaced 40 performed for the existence of a backpressure signal. If no 

must be considered If the switch interface is multi-user, the backpressure signal is found a spacing counter is set (step 

RT cell will be transferred directly through the switch before 63) to a value based on the selected spacing criteria. Then 

NRT/NR cells due to its higher level of priority. However, if the counter is periodically decremented (step 64) and its 

the switch interface is single-user, the RT cell must wait until content is tested to detect a zero count condition (step 65). 

the end of an NRT/NR packet currently being switched In 45 As long as the count is different from zero, the counter is 

this last case, spacing must be disabled to limit impact of decremented (step 64) unless an override condition due to 

NRT/NR traffic on RT traffic the presence of RT traffic is detected (step 66). If the spacing 

FIG. 5 is a flow-chart of operations which implement a counter count is at zero, or if an override queue condition is 

basic form of the above-described mechanism for control- set a cell is transmitted 

ling switch congestion. Assume a switch fabric can accept 50 If the result of the step 62 test is positive, indicating the 

cells either at 266 Mbps or (with the spacing function presence of a backpressure signal, a double test is performed 

enabled) at 45 Mbps. A receive adapter manager module sets (step 67 and 68). The first test checks whether the type of 

the spacing function ON, enabling adapter operation at 45 input traffic to be halted is NR traffic If it is. a timer 2 is set 

Mbps (see box 51). The receive adapter manager module to a count T2 (step 69). The count in timer 2 is then 

and the its associated receive switch interface (RSI) con- 55 decremented (step 70) and tested (71) in a loop operation 

tinually monitor the input queues for that receive adapter. As which may continue until test (71) shows the count has been 

long as the input data traffic is limited to NRT or NR data. decremented to zero. When a zero count is detected, trans- 

the spacing operation remains ON. However, when input RT mission of the interrupted NR cell is resu me d The loop 

traffic is detected the spacing function is disabled or turned operation including tests (70) and (71) can be overridden if 

OFF and the switch accepts input data at 266 Mbps. 60 input RT traffic is detected in a step (72). As noted earlier, 

A impro ved congestion control mechanism may also be the detection of RT traffic overrides the spacing mcdianisnx 

implemented to take into consideration the amount of input If the result of test (67) is negative, a similar test (step 68) 

traffic stored in various queues levels as compared to pre- is performed for NRT traffic shut-off conditions. An NRT 

defined threshold levels. This feature is described later with shut-off mechanism, similar to the above-described NR 

reference to FIGS. 7 through 9. 65 mechanism goes into operation with a counter set to Tl if the 

Although the basic spacing function already described result of test 68 is positive; mat is. NKT traffic is to be halted 

improves steady-state switch behavior, a further improve- The Tl counter can then decremented in a loop operation, 
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which continues cither until a zero count occurs (permitting 
resumption of NKT cell transmission) or until input RT 
traffic is detected (test 76), requiring an immediate override 
of the spacing function. 

In all instances* when a real-time queue override condi- 
tion occurs, the switch response, as already noted, depends 
on whether the switch interface is multi-user or single-user. 
A single-user switch interface prohibits interleaving of cells 
from different packets during their transmission through the 
switch when issued from the same adapter. For a multi-user 
switch interface, cells of different packets issued from a 
given adapter can be interleaved. In a multi-user 
environment when different classes of traffic are supported, 
a packet with a lower priority can be preempted by a higher 
priority packet In such a case, some cells from one packet 
are transmitted, men cells from another packet and so on. 

Accordingly, when a KT cell/packet arrives in an adapter 
during the transmission of a NRT/NR packet to the switch, 
the spacing mechanism is disabled In the multi-user case, 



The routing device (see 41 in FIG. 4) is represented as 
including a Line Interface Adapter (81) performing conven- 
tional packet reassembly operations, whereby DMA data 
and end of message (EOM) signals are prepared and for- 
5 warded to a Message Reception Control (MSGE Reception 
CTRL) device (82) wherein the message is controlled prior 
to being forwarded to a Message Routing device (83). If the 
message includes errors, it is discarded by the message 
routing device (83); that is, is not forwarded to the switch . 
io The message may be addressed to the local node, in which 
case it should also not be forwarded to the switch. That may 
be the case for a network control message destined for the 
local node in the course of a connection set-up or in topology 
information broadcast 
15 Receive adapter queuing (see 42 in FIG. 4) has been 
implemented in the system of FIG. 8 to process non-real- 
time (NKT), non-reserved (NR) and real-time (KT) traffic, 
with real-time traffic classified as having either KT1 and KT2 
priority, with the highest priority being assigned to RT1, next 



the KT cell is transferred directly to the switch fabric before 20 highest priority being assigned to KT2. then NKT and finally 
any additional NKI7NR data. In the single-user case, the RT NR. A counter (NO_RT Level) (84) connected to both NKT 
cell is not transferred until transmission of the current and NR queues is used to indicate the total number of 
NRT/NR packet is completed message bytes in both NKT and NR queues. This counter is 

The spacing function is required for large packets. For incremented at message reception (enqueue time) and dec- 
small ones, the spacing function may be ineffective and may 25 remented at message transmission (dequeue time). 



even create congestion at the input queue level. For a line 
speed of 4 M* bps. carrying V bytes packets, the equivalent 
switch throughput (ST) necessary at switch level is 



ST=((M*SCS)lbW/Sf*Lyr 



(4) 



where 



b^MVN 



TXT is the smallest integer larger than x, 
SCS is the size (in bytes) of the switch cell 



While all enqueuing (ENQ) operations are controlled by 
the message routing device (83), dequeuings are represented 
as being performed separately in an FT Spacing CTRL 
device (80) for both KT1 and KT2 traffic and in a device 
30 NO_KT Spacing CTRL (85) for NKT and NR traffic One 
should however, understand that these operations are all 
performed as already mentioned, in the programmable 
processor designated as P3 processor. The NO_RT Spacing 
CTRL device (85) is connected to an SP_CTRL register 
35 (86) containing the values of spacing (coded 0 through 4) to 
be used for sending Non-real-time messages to the switch. 
In the preferred embodiment of this invention, "(T corre- 
sponds to full speed (i, e. 266 Mbps), and "1", "2". *'3 M and 
4" correspond to 4 /s, 3 /s, Vs and Vs of full speed respectively. 



SPL is the maximum number of bytes issued from the 40 As already mentioned, real-time traffic shall be transferred at 



adapter, transmitted in a single switch cell (switch 
payload) car in other words, the useful portion of SCS. 
OVH is an overhead representing the number of bytes 
added to the packet issued from the line in the input 
adapter (CRC, routing information, . . . ) 
For example, assuming a line operating at 52 Mbps. an 
overhead of 14 bytes, a switch payload of 48 bytes and a 
switch cell of 58 bytes, the equivalent switch throughput 
ranges from 430 Mbps to 63 Mbps for packet sizes ranging 
from 7 to 2048 bytes. It is clear that for a small packet size, 
the expression for x must be reduced to 1, while the formula 
given previously fits for large packets. 

In order to avoid this problem, the previously mentioned 
threshold controlled mechanism can be implemented in the 



4t 0" spacing (see 87). The spacing values U 0 M through "4", 
stored into (86) or (87) are to be set into a switch interface 
I/O register SP_IOREG (88). Also, as already mentioned 
four threshold values (numbered TH1, TH2, TH3 and TH4) 
45 have been defined and stored into TH' s Registers (89). In the 
preferred embodiment, these thresholds have been selected 
to be set at 10 Mbps, 15 Mbps, 20 Mbps and 25 Mbps for 
TH1. TH2, TH3 and TH4 respectively: But means are 
provided for enabling the network designer to configure 
50 these values at will. 

The queued data is dequeued through a "transmit mes- 
sage** control device (XMTT MSGE CTRL) (90) in DMA 
form, and then forwarded to the switch via a register (DATA 
IOREG) (91) located in the receive switch interface adapter. 



adapter: switch input queues sizes are monitored and as soon 55 The rate for transferring the data from the DATAJOREG 

as the number of cells queued reaches a threshold Tel, to the switch is controlled by the SP_IOREG content. In 

spacing is disabled; when a threshold Te2 is reached, the other words, enabling and disabling of spacing is controlled 

spacing is enabled again. FIG. 7 represents this mechanism. by register SP__IOREG. 

In a preferred embodiment of this invention, threshold Represented in FIG. 9 is the spacing control flowchart as 

controlled spacing mechanism has been improved by being 60 implemented to take into consideration the four threshold 

implemented with mere than two specified threshold levels, levels as defined above. Upon initial entry, the total number 

with a predefined spacing ratio being assigned to each of bytes present in both NKT and NR queues is zero and a 

threshold condition. flag (Xmit_active flag), indicating whether a message is 

FIG. 8 is a block diagram showing an implementation of being transmitted to the receive switch interface (RSI), is set 

the threshold controlled spacing mechanism using four 65 OFF to indicate that no message is being transmitted (step 

threshold levels. This figure also provides a more detailed 92). The message reception control is exercised to test 

description of the receive adapter shown generally in FIG. 4. whether a message is received from an input line, In case of 
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negative answer, the NRT/NR spacing mechanism is set-up monitoring the input buffers in each receive adapter for 

(step 93) to its desired value by testing the No_RT_Level the presence of received packets having a predeter- 

with the following algorithm: mined priority level; 

IF No_KT_tevel>TH4-»Sp_CrRL=0 (Full speed) enabling a spacing function in the absence of received 

ELSEIFNo_RT_Level>TH3-»Sp_CrRL=l(% speed) s packets having the predetermined assigned priority 

ELSE !FNo_RT_Lcvel>TH2-»Sp_CniL=2 (% speed) leveL said spacing function causing received packets 

ELSE IF No RT Level>THl-»Sp CTRL=3 (% speed) having other than the predetermined priority level to be 

rjr - B .«„ 7-r»7=d iv, «v*rf^ transferred to the switch fabric at a transfer rate lower 

iiLitJ-*sp_yiKL=*ty3 ^eeo; ^ ^ predetermined packet transfer rate of the 

If a message is received from the line, a message routing 7^*? 

process is started (94) by first checking whether it is a to sw " cn • ™° ... 

real-time message. If this is the case the message is spacmg function upon detection of the 

enqueued into KTl or KT2 buffer according to the received presence of packets having the predetermined pnonty 

data, and the switch interface I/O register (Sp_JORBG) leveJ . «o tnaease the packet transfer rate to the prede- 

content is forced to "0" (step 95) and the process goes to the „ t ?^™J*7 set t ?f? sfcr rate - 

step wherein the flag. inSng whether a messages is being l5 . 2 , A . P™**** congestion control as set forth 

transmitted to the^itch interface, is tested. Otherwise, tf in . eLum 1 whe ^ eui *f I«^^ ,mnc f I""** ' CV ?. 1S 

the received message is not a real-time message, it is i- ^ ^ f£ 

enqueued to NRT orNR queues and the counter No_RT_ . 3 ; A P^^i congestion control as set forth 

Level is incremented (step 96). The process then goes lo step » ckun 2 7?**™ VT* ^ 

93 and then to a test which checks whether the Xmit^Vctive ^ three possible priority levels, two of which are lower than 

flag is ON. If the flag is ON. the process checks for end of 20 ** predetermined pnonty level and wherein there are at 

transmission, ff this is not the case, the process loops back least roo P**<* transfer rates lower man * e predetermined 

to checking whether a message is received from the line. In packet transfer rate. 

case of end of transmission, the Xmit_J\ctive flag is set OFF 4. A method of providing congestion control as set forth 

(step 97). Once this flag is set OFF or should this setting in claim 3 including the additional steps of: 

OFF be indicated by the test performed on said flag, a test 23 defining a threshold level for each of the input buffers in 

is performed on BT1/RT2 queue contents. If both are empty. the receive adapter; 

men a test is performed on NKI7NR queues to see if these monitor the input buffer contents to detect when the 

queues are empty. U they are. the process checks whether a contents of an input buffer exceed the defined threshold 

message is received from the line. If either of the KTl or level . ^ 

f\i U ^»4Jf e ^* e ^ Ue .^ Wte SL e ^ OUt " d M ""bling the transfer of packets at a rate lower than the 
* C l P -J2S E °x^ nteatS lS *° T (St ? ^ P^™* 6 ; predetermined packet transfer rate by allowing the 
rf QtJ ^ NKr " ^ <,U tJ e l L 1S • De f facing function* be enabled only while the buffer 
NFT/NR message would be extracted from the correspond- remain below the threshold level, otherwise 
ing queues), the ^content of Sp CIIU. register would be ^ ^ m ^ ^ packets to be 
transferred, mo SPJOREG and the counter No_JjT_Level 35 ^fe^ at the predetermined packet transfer rate, 
content would be decremented by the considered message 5 . For use in a data communications network wherein 
length (step 99) Both steps 98 and 99 connect to step 10» ^.^^ m ttansported between network 
where a flag indicating ^hether a message is being trans- ^^^^^ by transmit lines capable of trans- 
mitted to the switch interface is set ON and transmission to p^gp^^ * predetermined line rates and wherein one 

me „! W ? td l Star ^' . , , t ... . 40 of more of the network nodes includes a mumpart switch for 

Whfle there have been described what are considered to swium ^ ^ receivcd 0Q my of a piurtUty of input 

be preferred embodiment: of the invention, variations and ^^tnyof a plurality of output lines at packet transfer 

rnodifications to the preferred embodiments wm <xxur to ^ ^ ^ pacfceYtransfer rate, 

those sfallcd in the art once mey learn of the basic inventive ^ improved method for providing switch congestion control 

concepts. Therefore it is intended that the appended claims 45 ^ ^ 

shall be construed to include _the pref erred ««°bc^ments and swi ^ to pac^ts at a rate less than 

all such variations and modifications as fail within the true rat£ 0 ^ y wnen D0 ^ priority 

^ Wh" is S cSmed to* mV packets are detected at the input to the multiport switch; 

1, For use in a data communications network wherein data so , * n< ' . , , , 

packets having assigned priority levels are transported mcrcasing the packet transfer rate to the ma^um packet 

between network nodes iSerconnected by transmission ^Zt'TJ ™ ?l ^JZ^X I 

linksand wherein one ormoreof the network nodes includes ^ * t0 tec ^ ltch ; fc ^ 

a multi-port switch for switching packets received on any of * Fot use in a data communications network wherein 

a pluraUty of input lines to any of a plurality of output lines, 55 ^gh pnonty level packets and lower pnonty level pactets 

said switch comprising a pluraUty of receive adapters, each are transported between network nodes over interconnecting 

of which is connected to an input line and includes multiple transmission lines, an improved miiitiport switch located in 

input buffers for enqueueing received packets as a function at ! east one °f me ^ 0lk nodes ^ said improved multiport 

of the priority levels of those packets, a plurality of transmit swltch com P nsin S : 

adapters, each of which is connected to an output line, a 60 aplurality of receive adapters, each of which is connected 

switch fabric for directing a packet provided from one of the to an input line and includes multiple input buffers for 

multiple input buffers in one of the receive adapters to one enqueueing received packets as a function of the pri- 

of the transmit adapters, said switch fabric being capable of ^ty levels of those packets; 

transferring packets from receive adapters to transmit adapt* a plurality of transmit adapters, each of which is con- 

crs at a pr^termincd transfer rate, an improved method of 65 nected to an output line; 

providing congestion control in the switch comprising the a switch fabric for directing a packet provided from one 

steps of: of the multiple input buffers in one of the receive 
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adapters to one of the transmit adapters, said switch 
fabric being capable of transferring packets from 
iccc i ve adapters to transmit adapters at a predetermined 
transfer rate; and 5 
a manager module for causing enqueued packets to be 
transferred from the input buffers to the switch fabric at 
a rate not exceeding a first packet transfer rate provided 
no high priority packets are enqueued in the input 
buffers, said manager module causing enqueued pack- 10 
ets to be transferred from the input buffers to the switch 
fabric at a second packet transfer rate higher than the 
first packet transfer rate if any high priority packets are 
detected in the input buffers. 
7. An improved multiport switch as set forth in claim 6 15 
wherein the lower priority level includes at least two dif- 
ferent low priority levels and wherein the number of pos- 
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sible packet transfer rates which may be selected by the 
manager module equals the number of possible packet 
priority levels. 

8. An improved multiport switch as set forth in claim 6 or 
claim 7 further including 
buffer registers for storing threshold values representing 
upper limits on the amount of data to be enqueued in 
each of the multiple input buffers; 
a threshold detector for producing a limit signal when the 
amount of data enqueued in one of more of said 
multiple input buffers has exceeded the applicable 
threshold value; and 
means responsive to the limit signal to increase the rate at 
which packets are transferred from the receive adapter 
to the switching fabric 
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